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1.  Introduction 

The  theory  of  evolution  is  a  field  of  research  in  which  many  mathematical 
investigations  have  been  made  and  in  which  many  unsolved  problems  remain. 
Whatever  one’s  opinions  are  about  the  ultimate  causes  of  evolutionary  progress 
(a  still  debatable  problem),  it  is  now  clear  that  an  overwhelming  part  in  deter¬ 
mining  the  direction  of  evolution  is  played  by  selection  and  mutation,  and 
furthermore,  that  the  only  empirically  testable  theory  of  evolution  that  has  at 
the  present  time  any  plausibility  is  that  evolution  is  the  result  of  these  two 
factors. 

In  the  Proceedings  of  the  Second  Berkeley  Symposium,  Professor  W.  Feller 
gave  a  lecture  on  “Diffusion  processes  in  genetics”  in  which  he  drew  attention 
to  the  very  interesting  stochastic  processes  constructed  and  studied  by  Wright 
and  Fisher  which  attempt  to  describe  and  explain  what  happens  to  the  genetical 
structure  of  populations  when  we  take  account  of  the  fact  that  the  population 
has  a  finite  size  so  that  what  happens  at  birth,  mating,  and  death  is  not  deter¬ 
minate  but  has  a  random  character.  This  lecture  has  stimulated  a  good  deal  of 
further  research  on  such  random  processes  (see  Moran  [14]  and  more  recent 
papers  of  Karlin,  McGregor,  and  Ewens). 

So  long  as  the  population  size  is  effectively  finite,  use  of  the  theory  of  random 
processes  is  essential.  In  this  way  we  can  study  such  problems  as  the  rate  at 
which  a  population  becomes  homozygous  at  some  locus  when  there  is  no  mutation 
(a  phenomenon  called  “Drift”  by  Sewall  Wright  and  which  he  holds,  I  think 
incorrectly,  to  be  of  importance  in  evolution),  the  stationary  distributions  of 
gene  frequencies  when  there  is  mutation,  and  the  probability  of  survival  of  new 
mutants. 

Although  these  problems  must  remain  of  great  interest  to  students  of  stochastic 
processes,  I  believe  that  the  study  of  the  deterministic  processes  which  effectively 
describe  what  happens  when  the  population  size  is  large  are  more  important 
from  the  evolutionary  point  of  view.  This  is  due  to  the  fact  that  the  theory  of 
stochastic  genetic  processes  shows  that  the  influence  of  the  random  element  is 
of  the  same  order  as  the  reciprocal  of  the  population  size.  Thus,  for  example, 
for  populations  of  size  greater  than  104,  deterministic  theory  is  sufficient  to 
answer  nearly  all  problems  concerning  selection  in  which  the  selection  coeffi¬ 
cients  are,  say,  greater  than  10-3.  Similarly,  in  studying  the  effect  of  mutation 
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we  can  use  deterministic  theory  provided  the  population  size  is  substantially 
greater  than  the  reciprocal  of  the  mutation  rate  (except,  of  course,  for  the 
special  problem  of  finding  the  probability  of  survival  of  a  single  new  mutant). 

Most  natural  populations  probably  satisfy  these  conditions,  as  is  illustrated 
by  the  extreme  case  of  most  species  of  insects,  although  occasionally  there  are 
some  small  isolated  populations  whose  size  is  sufficiently  small  to  require  consid¬ 
eration  of  the  stochastic  element.  This  is  illustrated,  for  example,  by  laboratory 
populations  or  by  the  populations  of  the  grasshopper,  Moraba  Scurra,  in  the 
graveyards  of  Australian  country  towns,  which  have  been  extensively  studied 
by  M.  J.  D.  White. 

It  is  of  some  importance  to  note  that  for  the  stochastic  element  to  be  important 
from  the  evolutionary  point  of  view,  such  populations  must  be  completely 
isolated.  It  can  be  shown  (Moran  [13])  that  conclusions  about  the  stochastic 
behavior  of  large  populations  are  unaffected  if  these  populations  are  divided 
into  many  smaller  populations,  provided  there  is  a  small  but  nonzero  amount 
of  migration  between  them.  The  effective  population  size  is  then  practically 
equal  to  the  whole  population. 

Our  aim,  therefore,  in  this  paper  is  to  consider  the  theory  of  evolution  in 
populations  large  enough  for  stochastic  variations  to  be  ignored.  Since  we  are 
then  concerned  with  average  numbers  or  proportions  of  individuals  of  a  specific 
type,  the  subject  can  still  be  rightly  regarded  as  a  part  of  statistical  analysis. 

We  define  the  state  of  such  a  population  at  a  given  time,  or  in  the  case  of  non¬ 
overlapping  generations  in  a  given  generation,  by  a  finite  set  of  quantities  xh 
•  •  •  ,  Xk.  It  is  often  convenient  to  write  this  set  in  vector  form,  3?  =  {xx,  •  •  •  ,  xk), 
although  it  does  not  have  the  algebraic  properties  of  a  vector.  We  attach  a 
suffix,  t,  so  that  xt  is  the  state  in  the  tth.  generation,  or  if  we  are  considering  a 
model  which  evolves  continuously  in  time,  we  write  z(t)  where  t  is  a  continuous 
variable. 

The  assumed  properties  of  the  population  then  enable  us  to  write  down  a 
relation  between  the  values  of  3?  for  one  generation  (the  parent  generation)  and 
the  corresponding  values  in  the  next  generation.  We  therefore  begin  by  consid¬ 
ering  the  properties  of  recurrence  relations  of  the  form 

(1)  £t+ 1  =/(Zi)  =  (fi(Xt),  •••  ,/*($«)). 

In  most  cases  the  components  of  x  will  be  the  relative  proportions  of  genetic 
individuals  of  various  types  (genes  or  zygotes)  and  will  therefore  add  to  unity 
and  be  nonnegative.  Equality  (1)  is  then  the  mapping  of  the  interior  of  a  (k  —  1) 
dimensional  simplex  onto  itself.  This  is  the  case  considered  in  this  paper. 

2.  Recurrence  relations  and  their  related  differential  equations 

We  first  look  for  the  stationary  states  of  the  system  which  will  be  given  by 
the  relevant  solutions  of  the  equation  obtained  from  (1)  by  putting  3;  =  —  £t+ 1, 
which  is  thus 
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(2)  x  =  f(x). 

It  is  possible  for  the  solutions  of  this  equation  to  consist  of  isolated  points,  lines 
in  the  space  of  the  ( Xi ),  or  the  whole  region  in  which  x  is  defined.  Any  solution 
of  (2)  will  be  said  to  be  a  “stationary  point”  if  X!  =  1>  all  Xi  ^  0,  and  if  we 
have  further  that  all  the  Xi  are  greater  than  zero,  we  shall  say  it  is  an  “internal” 
stationary  point.  In  most  cases  we  will  be  concerned  with  the  number  and 
position  of  isolated  stationary  internal  points. 

We  then  have  to  consider  the  local  stability  of  the  system  for  small  variations 
about  the  stationary  values.  Write  Xt  —  X  +  t  where  x  is  a  solution  of  (2)  and  ?/< 
is  a  vector  (yu,  •  •  •  ,  yu)  of  small  quantities  adding  to  zero.  Then  under  suitable 
conditions  on  the  function /(a?), 

(3)  x,  +  yiMt  -  f((X)  +  E  y,,t  +  0(E  Vi). 

It  is  clear  that  a  sufficient  condition  for  the  point  x  to  be  locally  stable  is  that 
all  the  roots  of  the  matrix, 


should  have  moduli  less  than  unity,  and  a  sufficient  condition  for  instability 
is  that  at  least  one  of  the  roots  has  modulus  greater  than  unity.  When  all  the 
moduli  are  equal  to  unity  the  situation  becomes  more  difficult.  As  an  example 
of  this,  if  we  take  Wi  =  1,  w2  =  w3  =  2  in  the  equation  (9)  discussed  later,  we 
obtain  a  recurrence  relation  of  the  form 


(5) 


Z<+1  =  f(xt)  =  Xt 


2  —  xt 


where  0  ^  xt  ^  1.  Even  though  xt  =  0  is  a  stationary  point, /'(0)  =  1,  and  the 
above  criterion  does  not  apply,  although  this  is  clearly  stable  since  xt+\  <  xt 
for  0  <  xt  <  1. 

It  is  sometimes  easier  to  consider  only  the  behavior  of  the  population  in  the 
neighborhood  of  the  boundary  of  the  region.  Starting  from  a  boundary  point 
and  introducing  a  perturbation  of  an  allowed  kind  (for  example,  all  Xi  ^  0), 
it  is  often  fairly  easy  to  show  that  all  boundary  points  are  unstable.  This  makes 
it  likely  that  there  is  a  stable  stationary  point  in  the  interior,  although  it  does 
not  prove  it  since  there  remains  the  theoretical  possibility  of  cyclic  behavior. 
This  approach  has  been  extensively  developed  by  Bodmer  [3]  (see  also  Bodmer 
and  Parsons  [4],  [5]),  who  was  primarily  interested  in  the  initial  progress  and 
survival  of  new  mutant  genes,  and  I  have  used  it  elsewhere  (Moran  [17])  to 
discuss  a  model  in  which  I  was  unable  to  prove  directly  the  existence  of  a  stable 
stationary  interior  state. 

Associated  with  equation  (1),  there  is  a  related  theory  for  continuous  time 
using  differential  equations.  This  is  most  simply  approached  if  we  suppose  that 
the  components  of  x  are  the  relative  frequencies  of  elements  of  k  different 
kinds  in  some  population.  We  assume  that  in  a  small  interval  of  time  (t,  t  +  dt), 
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a  fraction,  dt,  of  the  population  dies  and  is  replaced  by  dt  new  offspring  of  types 
whose  relative  frequency  is  given  by  the  components  of  £(x(t)),  where  x(t)  is 
the  vector  giving  the  structure  of  the  population  at  time  t,  the  notation  being 
changed  to  denote  that  t  is  now  continuous.  We  then  have  x(t  +  dt)  = 
(1  —  dt)z(t)  +  dt£(x(t))  from  which  we  derive  the  system  of  differential  equations 

(6)  ^  =-*(*)+ !(*«)). 

The  stationary  states  are  again  given  by  the  solutions  of  (2).  Contrary  to  what 
we  might  expect,  we  shall  see  later  that  (6)  does  not  always  correspond  to  the 
natural  model  of  a  genetic  population  evolving  continuously  in  time  unless  the 
components  Xi(t)  of  x(t)  are  such  as  to  give  a  complete  description  of  the  state 
of  the  system.  Nevertheless,  (6)  can  be  used  as  an  approximation  to  the  behav¬ 
ior  of  (1),  has  the  same  stationary  points,  and  has  a  closely  related  stability 
theory. 

Let  x  be  a  solution  of  (2)  and  write  x(t)  =  x  +  y(t)  where  y(t)  is  a  vector  of 
small  components  representing  a  “variation”  or  perturbation  around  the  point  x. 
The  asymptotic  behavior  of  y(t)  will  provide  us  with  a  theory  of  the  stability 
of  the  system  (6)  around  the  point  x  (for  a  general  survey  of  the  analytical 
theory,  see  Cesari  [6]). 

Under  suitable  conditions  on  f(x(t)),  we  can  immediately  derive  the  “varia¬ 
tional”  equations  for  y(t)  which  are 

(7)  *f.  = 

Then  all  yi(t)  — >  0  if  the  roots  of  the  matrix  (4)  are  such  that  their  real  parts 
are  less  than  unity. 

Stability  in  the  above  sense  is  a  “local”  stability.  Given  a  stationary  point  x 
we  can  associate  with  it  the  set  of  all  points  P,  such  that  starting  from  P  the 
population  ultimately  converges  to  g.  This  may  be  called  the  domain  of  attrac¬ 
tion  of  a?.  If  the  domain  of  attraction  of  an  internal  point  %  contains  all  internal 
points  of  the  simplex,  we  say  that  x  is  “globally”  stable. 

3.  Selection  dependent  on  a  single  locus  with  two  alleles 

Consider  a  model  of  a  genetic  population  in  which  there  are  two  alleles,  A 
and  a,  at  a  single  locus  such  that  the  “fitness”  of  a  newly  formed  individual 
zygote  is  dependent  on  this  locus  alone.  There  are  three  possible  types  of  zygote, 
A  A,  Aa,  and  aa,  and  we  define  their  fitnesses  to  be  nonnegative  numbers  wh 
w<2,  wh  proportional  to  the  numbers  of  gametes  that  a  zygote  of  these  types, 
newly  formed  by  union  of  gametes,  will  contribute  to  the  formation  of  the  next 
generation.  We  assume  that  successive  generations  are  nonoverlapping  and  that 
mating  is  random,  so  that  the  probabilities  of  formation  of  zygotes  A  A,  A  a, 
and  aa  will  be  p(A)2,  2p(A)p(a),  p(a)2  respectively,  where  p(A)  and  v(a)  = 
1  —  p(A)  are  the  relative  frequencies  of  the  A  and  a  gametes  entering  into  the 
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formation  of  the  zygotic  population.  Let  p  =  1  —  q  be  the  frequency  p{A)  in 
the  first  (parent)  generation  which  we  suppose  to  have  been  produced  in  this 
manner.  Then  the  mean  fitness  of  this  population  is 

(8)  W  =  Wip2  +  2w2pq  +  w3q2. 

It  is  important  to  notice  that  in  this  discrete  model  the  w’s  are  not  simply 
the  relative  fertilities  of  the  adult  zygotes,  but  they  also  depend  on  the  prob¬ 
abilities  that  the  newly  formed  zygotes  survive  to  the  age  of  reproduction. 

If  p'  is  the  frequency  of  A  in  the  next  generation,  we  have 


(9) 


V  =  V 


Wip  +  w2q 
W  ’ 


an  equation  of  type  (1).  This  can  also  be  written 


(10) 


p'  -  p  =  pq 


( Wi  —  w2)p  -f-  ( w2  —  w3)q  _  pq  dW 
W  ~  W  dp’ 


where  the  derivative  of  W  is  calculated  remembering  that  q  =  1  —  p.  The 
stationary  points  are  therefore  given  by  equating  the  right-hand  side  of  (10) 
to  zero  and  thus,  either  p  —  0,  or  q  =  0,  or  the  derivative  of  W  is  zero  for  some 
internal  point  satisfying  0  <  p  <  1.  It  is  further  easily  seen  that  for  such  an 
internal  point  to  be  locally  stable,  we  must  have 


(ID 


d2W 

dp2 


<0. 


If  this  is  so,  2w2  —  wi  —  w2  >  0  and  also  w2  >  wh  w2  >  w3.  The  internal  point 
is  then  stable  for  any  allowable  deviations,  and  the  points  p  —  0,  1  are  unstable 
stationary  points. 

Thus  if  W  attains  a  maximum  at  an  internal  stable  stationary  point,  its 
value  there  is 


(12) 

and  p  is  given  by 
(13) 


W  = 


Wi  —  WiW3 


2  w2  —  Wi 


w3 


V  = 


w2  —  w3 
2w2  —  Wi  —  w3 


Thus  the  population  moves  to  maximize  the  mean  fitness  W.  Notice  that  the 
position  and  stability  of  such  an  internal  point  is  unaltered  if  we  transform  Wi 
into  another  set  of  values  by  the  transformation  Wi  — *  \Wi  +  n,  where  X  >  0, 
ju  ^  —  X  min  Wi,  but  the  rate  of  convergence  of  the  population  to  its  equilibrium 
value  is  altered  unless  p  =  0.  It  is  also  not  hard  to  show  that  in  this  simple  case 
the  gene  frequency  changes  monotonically  from  its  initial  value  to  its  final  value 
and  does  not  overshoot  the  latter. 

Now  consider  the  effect  of  mutation  on  a  stationary  state  of  the  population. 
Suppose  first  that  in  the  absence  of  mutation  there  is  a  stationary  stable  internal 
state  (that  is,  one  with  0  <  p  <  1).  Let  A  mutate  to  a  at  a  rate  per  gamete 
per  generation,  and  a  to  A  at  a  rate  a2.  The  rates  an  and  a2  will  usually  not  be 
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larger  than  10-4.  Let  p0  =  1  —  q0  be  the  gene  frequency  of  A  in  the  absence  of 
mutation  so  that  p0  is  given  by  (13).  Let  selection  in  the  production  of  the 
gametes  occur  first  and  then  mutation.  If  p  is  the  equilibrium  gene  frequency, 
we  therefore  have  Wp  =  p(l  —  on) (wip  +  w2q)  +  qa2(w2p  +  w3q),  from  which 
we  get,  to  a  sufficiently  good  approximation, 


(14) 


w3  —  w2 


Wipo  +  w2qQ 


=  Po 


2w2  —  Wi  —  w3  poqo(2w2  —  w\  —  w3) 
W  (atiPo  —  a2qo) 


(capo  —  a2qo) 


p0q0(2w2  —  Wi  —  w3) 


The  effect  on  the  fitness,  W,  is  therefore  to  lower  it  by  an  amount  (ignoring 
terms  of  higher  order)  equal  to 

/ ,  rx  IF2(q!iPo  —  a2qo)2  d2W  _  2(w%  —  WiW3)2(ogpo  —  a2q{>)2 

'  '  2plqi{2w2  -  Wi  —  w3)2  dp 2  plql{2w2  —  wx  —  w3)z 

This  is  of  the  second  order  in  the  a’s,  and  thus  mutation  has  little  effect  on 
fitness  dependent  on  a  single  locus  at  which  there  is  a  stable  internal  polymor¬ 
phism. 

The  situation  is  different  when  there  is  no  internal  stable  point.  Consider  the 
particular  case  Wi  <  w2  <  w3.  Then  the  only  stable  stationary  case  is  given  by 
p  =  0.  The  mean  fitness  IT  is  a  maximum  for  this  value,  but  because  it  occurs 
at  the  end  of  the  range,  its  derivative  is  not  in  general  zero.  The  mutation  of  A 
to  a  will  have  little  effect,  so  we  put  ai  =  0  and  write  a  for  a2.  The  stationary 
points  are  then  the  solutions  of  Wp  —  p(wip  +  w2q)  +  qa{w2p  +  w3q).  One 
solution  is  p  =  1,  and  the  other  is  very  close  to  p  =  ( w3/w3  —  w2)a,  if  we  assume 
a  small  compared  with  w3  —  w2.  Since 


(16)  =  2 (wip  +  w2(q  -  p)  -  w3q), 

which  is  equal  to  2  (w2  —  w3)  at  p  =  0,  the  decrease  in  fitness  is  about  2 w3a 
which  is  thus  of  the  same  order  as  the  mutation  rate. 

It  follows  that  if  we  consider  a  single  locus,  decrease  in  fitness  due  to  mutation 
will  only  be  important  in  the  absence  of  a  selectively  balanced  polymorphism. 
We  shall  see  later  that  when  fitness  is  controlled  by  more  than  one  locus,  the 
effect  of  mutation  may  be  of  the  first  instead  of  the  second  order  at  a  stable 
stationary  internal  point,  because  fitness  dependent  on  more  than  one  locus  is 
not  in  general  a  maximum  at  such  a  point. 


4.  Selection  dependent  on  a  single  locus  with  more  than  two  alleles 

The  above  theory  has  been  known  for  many  years.  More  recently  attention 
has  been  devoted  to  selection  dependent  on  a  single  locus  at  which  there  are 
more  than  two  alleles,  and  here  the  situation  is  not  quite  so  simple.  We  assume 
again  that  we  have  nonoverlapping  generations  of  diploid  individuals  who  mate 
at  random  so  that  the  frequencies  of  the  various  classes  of  zygotes  in  the  next 
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generation  are  proportional  to  the  products  of  the  frequencies  of  the  gametes. 
Let  there  be  k  alleles  A1}  •  •  •  ,  Ak  at  the  given  locus.  Then  we  represent  the 
zygotes  by  the  symbols  {A  ,A  y)  (i,  j  —  1,  •  •  •  ,  k).  There  are  clearly  (1/2 )n(n  +  1) 
different  pairs,  but  it  is  convenient  algebraically  to  draw  a  distinction  between 
(AiAj)  and  (A1Al)  when  i  j ,  while  keeping  their  frequencies  equal.  This 
could  be  done,  for  example,  by  conventionally  supposing  that  the  first  of  the 
two  alleles  is  derived  from  the  father  and  the  second  from  the  mother.  We  shall 
not,  however,  consider  models  in  which  the  gene  frequencies  or  selection  co¬ 
efficients  are  different  in  the  two  sexes,  although  such  models  are  of  interest  in 
some  special  cases. 

Let  the  gene  frequency  of  Ai  among  the  gametes  which  formed  the  first 
generation  be  p,(E  p,  =  1).  Then  using  the  above  convention  and  the  fact  that 
mating  is  random,  the  frequency  of  newly  formed  ( AiAj )  in  the  first  generation 
will  be  pipj.  The  fitness  of  (AiAj)  is  defined  to  be  a  nonnegative  number 
proportional  to  the  number  of  gametes  this  newly  formed  zygote  will  contribute 
to  the  next  generation.  Then  the  mean  fitness  of  the  first  generation  is 

(17)  W  =  E  Wijpipj, 

a 

where  the  sum  is  taken  over  all  values  of  i  and  j. 

The  numbers  of  Ai  in  the  gametic  product  of  this  generation  will  be  propor¬ 
tional  to 

(18)  E  WijPiPj  +  E  WkipkPi  =  2 Pi  E  WijPj. 

j  k  j 

The  sum  of  all  such  expressions  is  2W,  and  if  we  write  p[  for  the  frequency  of  A{ 
in  the  next  generation,  we  have 

(19)  p[  =  pAV~l  E  WiiPi- 

j 

The  mean  fitness,  W',  of  the  next  generation  is 

(20)  W'  =  E  WuV'iPi  =  W~2  E  WijWikWtjPiPjPkpi . 

ijkt 

It  is  a  remarkable  fact  that  W'  ^  W,  so  that  the  mean  fitness  does  not 
decrease  from  generation  to  generation.  This  was  first  proved  by  Scheuer  and 
Mandel  [22],  and  by  Mulholland  and  Smith  [18].  Later  proofs  were  given  by 
Atkinson,  Watterson,  and  Moran  [1],  and  by  Kingman  [9],  [10].  The  last  proof 
is  particularly  simple  and  also  makes  clear  the  conditions  under  which  W'  =  W, 
so  that  we  give  it  here. 

To  do  this  we  use  repeatedly  the  inequality 


where  at  ^  0,  E  ai  >  0,  ^  0,  k  ^  1.  We  have,  on  renumbering  the  suffices, 
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(22)  £  W ijW ik.Wtj'P iPjPkPt  =  \  E  WijWik{Y<  WliPt  +  E  WmkPm)PiPjPk 

ijkt  £  ijk  t  m 

^  52  WijWikCL  W(jWmkPtPm)ll2PiPjPk, 

ijk  (m 

on  using  the  inequality  between  arithmetic  and  geometric  means.  This  in  turn 
is  equal  to 

(23)  52  p.{E  ^^(52  watt)11*}*  ^  {E  Pi  52  wupA E  wm)m}2,  by  (20). 

i  j  t  i  j  ( 

This  can  be  written  as  (Ej  pAHe  wtjPt)Z12} 2,  and  using  (21)  again,  this  is  not 
less  than 

(24)  {(E  wijPtPj)m}2  =  (E  WiiPiPi )3. 

tj 

The  sequence  of  inequalities  is  then  equivalent  to  W’  ^  W,  and  equality  only 
occurs  at  an  internal  point  (all  pi  >  0)  if 

(25)  51  wtjPi  =  E  WiiPiPj  for  all  l. 

j  ij 

This  last  equation  is  equivalent  to  the  condition  for  stationarity  that  we  get 
when  we  put  p[  —  pi  >  0  in  (19).  Thus  the  mean  fitness  must  increase  if  the 
initial  point  is  an  internal  point  which  is  not  stationary,  and  a  stable  stationary 
internal  point  must  be  a  local  maximum  of  W. 

We  can  look  at  this  in  another  way.  Consider  the  stationary  points  of  W 
subject  to  the  condition  that  52  Pi  —  1-  Since  IT  is  a  quadratic  form  in  the  pif 
finding  its  stationary  points  under  this  condition  is  equivalent  to  finding  the 
stationary  values  of  =  W/iJlpd2  when  the  pi  have  arbitrary  nonnegative 
values.  Then 

(26)  ^=(EP,')-»{EP,  f -2  w}. 

Putting  this  equal  to  zero,  we  again  obtain  the  conditions  52jwaPi  =  W,  for 
all  i,  which,  by  (19),  must  hold  at  any  internal  stationary  point.  The  fact  that 
W  is  a  quadratic  form  implies  that  if  there  is  an  isolated  stationary  point,  there 
is  only  one  such  point. 

The  conditions  under  which  such  an  internal  stationary  point  is  stable  have 
been  discussed  by  Mandel  [12],  Kimura  [7],  Penrose,  Smith,  and  Sprott  [21], 
and  Kingman  [10]. 

We  shall  see  later  that  the  principle  that  the  mean  fitness  of  a  population  tends 
to  increase  until  it  reaches  a  maximum  holds  in  general  only  for  fitness  dependent 
on  a  single  locus  and  is  not  a  general  principle  of  evolutionary  theory. 


6.  The  single  locus  problem  with  continuous  time 

Associated  with  equations  (19),  we  naturally  have  a  continuous  time  model 
in  Pi(t),  the  frequency  of  A  ,•  at  time  t,  satisfying  the  differential  equations 
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(27) 


=  -pS)  +  w(t)~lpi(t)  E  Wijpjit), 


where  W(t)  =  E<;  WiiPi(t)Pj(t)- 

If  the  pi(t)  satisfy  this  equation,  it  is  now  much  easier  to  prove  that  JF(£) 
cannot  decrease,  for  we  have 


(28) 


1  dWjt) 

2  dt 


5 

E  WijPi(t){-pj(t)  +  IF (t)~lpj(t)  E  w,-*p*(0} 

O'  A 

IF (0_1  IE  WijPi(t)pj(t )  E  Wjkpk(t)  -  wm 


=  wit)-1  (E  p,-(0  (E  wjkpk(t))2  -  [E  p,-(0  E  w;*p*(0P}  ^  o, 

j  k  j  k 

because  of  the  inequality  E  Piai  =  (E  ?>*««) 2  for  p,  =  0,  E  P<  —  F  ^  0.  The 
mean  fitness  of  those  zygotes  which  contain  A{  is  Efc  WjkPk(t).  Thus  the  rate  of 
change  of  IF ( t )  on  this  model  is  proportional  to  the  variance  of  these  quantities. 
This  is  Fisher’s  so  called  “Fundamental  Theorem  of  Natural  Selection.” 

Unfortunately,  however,  equations  (27)  do  not  correspond  to  a  natural  model  of 
an  evolving  population  and  can  be  regarded  only  as  an  approximation  useful  when 
the  u'a  are  nearly  all  equal.  The  reason  for  this  is  that  although  mating  remains 
always  “at  random,”  the  frequencies  of  zygotes  (AiA3)  do  not  remain  equal  to 
the  products  of  the  gene  frequencies,  even  if  they  start  that  way. 

To  see  this,  consider  a  population  of  zygotes  (AiAj)  whose  frequencies  are 
Pa{t)  =  pa(t),  and  are  such  that  initially  (for  example,  at  t  =  0)  they  are  equal 
to  p;{t)  pj(t).  In  any  small  interval  of  time  (t,  t  +  dt)  we  suppose  that  a  fraction, 
dt,  of  the  population  of  zygotes  dies,  and  is  replaced  by  dt  offspring  produced  by 
random  mating  among  the  parents  existing  at  time  t,  due  accord  being  taken  of 
the  selective  differences,  wi3,  which  now  have  to  be  regarded  as  relative  fertilities 
since  the  chance  of  death  of  an  individual  in  (t,  t  +  dt)  must  be  independent  of  its 
age  for  this  model.  Then 


(29)  Pij(t  +  dt)  =  (1  -  dt)pij(t)  +  dt  Pi(t)pj(t)W(t )“2  E  WikPkit)  E  wtjPt(t). 

k  ( 

The  appropriate  differential  equation  is  then 

(30)  =  -Vij(t)  +  Pi(t)pj(t)W(t)-2  E  WikPkit)  E  KtjPtit). 

at  k  t 

At  t  =  0,  Pn{t)  =  Pi(t)pj(t),  and  if  this  is  to  remain  true  for  t  >  0,  we  must 
have 

(31)  _  Vi(t)  <tm  +  m  dPM 


=  Pi(t)pi(t){— 2  +  Wit)-1  E  WikPkit)  +  W it)-1  E  W(jPtit)}. 

k  l 


Equating  this  to  (30),  replacing  p,jit)  by  Piit)pjit),  and  assuming  the  latter  to  be 
nonzero,  we  get 
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(W(t)  -  E  wikVk{t)){w  -  E  wm(t))  =  0, 

k  l 

which  implies  that  either  dpi(t)/dt  =  0,  or  dpj(t)/dt  =  0,  which  is  not  in  general 
true  unless  all  the  w,j  are  equal,  or  the  population  is  stationary.  The  population 
will  tend  to  the  same  stationary  point  as  the  twTo  previous  models,  and  the  pi(t ) 
will  have  the  values  obtained  by  maximizing  E  Wijpi(t)pj(t).  The  true  mean 
fitness  of  the  population  is  E  waPij(t).  At  the  stationary  point  this  will  have  the 
value  previously  given,  but  in  the  course  of  approaching  this  point,  it  may  in¬ 
crease  or  decrease.  This  is  very  simply  seen  to  be  the  case  if  we  consider  a  two 
allele  situation  in  which  the  zygotes  (AA),  (A a),  and  (ad)  have  (say)  the  fitnesses 
1,  2,  1,  and  suppose  that  the  population  starts  in  a  state  in  which  all  individuals 
are  (Ad).  The  true  mean  fitness  must  then  decrease.  Notice  also  that  in  this  con¬ 
tinuous  time  model  “random  mating”  does  not  imply  that  the  frequency  of  a 
zygote  is  equal  to  the  product  of  the  frequencies  of  the  gametes  which  entered 
into  its  formation,  except  when  the  state  of  the  population  is  stationary. 

Before  leaving  the  subject  of  selection  dependent  on  a  single  locus  we  mention 
two  further  generalizations.  We  have  so  far  assumed  that  the  are  constants 
independent  of  the  gene  frequencies.  It  is  easy  to  imagine  plausible  situations  in 
which  this  is  not  so.  The  mean  fitness  may  then  decrease,  and  the  theory  is 
naturally  more  complicated.  Some  examples  are  given  in  Moran  [14]. 

Another  restriction  that  may  be  relaxed  is  that  of  random  mating.  One 
alternative  is  “assortative”  mating.  This  occurs  when  there  is  a  tendency  for 
some  types  of  mating  to  have  a  greater  or  lower  probability  than  the  product  of 
the  frequencies  of  the  types  involved.  When  individuals  have  an  increased  proba¬ 
bility  of  mating  with  other  individuals  of  the  same  genotype,  the  assortment  is 
said  to  be  positive,  and  in  the  reverse  case  negative.  In  deterministic  models  the 
resulting  theory  is  complicated  but  not  too  difficult.  The  effect  of  negative 
assortative  mating  is  quite  different  from  that  of  positive  assortative  mating  in 
that  the  latter  does  not  tend  to  alter  the  gene  frequencies  whereas  the  former 
has  the  effect  of  forcing  the  gene  frequencies  (in  the  two  allele  case)  towards  the 
value  1/2.  Thus,  negative  assortative  mating  results  in  a  pseudo-selective  hetero¬ 
zygote  advantage  which  can  produce  a  permanent  polymorphism.  This  is  really 
the  process  which  keeps  the  sex  ratio  at  conception  near  unity.  In  stochastic 
models  assortative  mating,  especially  when  negative,  is  much  more  difficult  to 
deal  with  (Watterson  [23]). 

Another  subject  which  might  repay  mathematical  analysis  is  the  evolution  of 
genetic  populations  which  depend  on  each  other  by,  for  example,  competition, 
predation,  or  parasitism.  This  would  be  of  interest  in  the  study  of  the  evolution 
of  disease. 


6.  Selection  dependent  on  several  loci 

We  now  turn  to  the  case  of  more  than  one  locus.  Suppose  that  there  are  k  >  1 
loci  at  each  of  which  there  are  two  alleles,  A  *  and  (i  —  1,  •  •  •  ,  k).  These  loci 
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need  not  all  be  on  the  same  chromosome,  but  for  a  general  theory  we  may 
imagine  them  all  on  a  single  chromosome  separated,  if  necessary,  into  groups  so 
far  apart  that  there  is  so  much  crossing  over  between  them  that  they  behave  as 
if  they  were  on  different  chromosomes. 

The  restriction  to  two  alleles  at  each  locus  is  not  important,  for  two  loci  at 
which  there  are  alleles  Ah  a1}  and  A2,  a2,  respectively,  might  be  so  close  together 
that  there  is  no  crossing  over  between  them.  They  then  behave  as  a  single  locus 
with  four  alleles  which  can  be  denoted  by  AiA2,  A i«2,  aiA2,  and  aia2.  Similarly, 
we  could  obtain  a  ‘locus’  with  2s  (s  >  2)  effective  alleles,  and  then,  by  identifi¬ 
cation,  a  locus  with  any  number  of  alleles  less  than  2®. 

We  denote  each  gamete  by  a  vector  (xt)  =  (xh  •  •  •  ,  xk),  where  Xi  =  0,  1  ac¬ 
cording  to  whether  the  gamete  contains  4,-  or  a,.  Similarly,  we  denote  a  zygote 
by  an  ordered  pair  of  such  vectors  (xi\xl)  =  (xi,  •  •  •  ,  xk\xi,  •  •  •  ,  x'k)  where  it  is 
again  algebraically  convenient  to  distinguish  such  a  zygote  from  (xt'|xt)  = 
(x[,  ■  •  •  ,  x'k\x\,  •  •  •  ,  Xk)  by  some  conventional  distinction  (such  as  the  origin  of 
the  constituent  gametes  from  father  or  mother)  which  ensures  that  the  fre¬ 
quencies  of  such  pairs  of  zygotes  are  equal. 

The  fitnesses  of  the  zygotes  are  again  defined  by  nonnegative  numbers  w(xi\xl) 
where  not  only  do  we  assume  w(xi\x,)  =  w(xi\xl),  but  we  shall  also  assume  that 
the  iv’s  do  not  depend  on  how  the  alleles  are  distributed  among  the  two  com¬ 
ponents;  that  is,  we  shall  suppose,  for  example,  that  w(\  1|00)  is  equal  to  u’(10|01). 
This  is  equivalent  to  assuming  that  the  w’s  are  functions  of  (zt)  =  (aq  +  xi) 
only.  We  shall,  however,  have  to  distinguish  (x%)  and  (xi)  when  we  find  the 
gametic  output  of  each  zygote  (some  evolutionary  problems  have  been  considered 
in  which  the  w’s  depend  on  the  (xt)  and  (xi)  separately,  but  we  shall  not  consider 
this  “position  effect”). 

Let  the  generations  be  nonoverlapping,  and  suppose  that  in  the  gametic  output 
which  formed  the  first  generation  the  frequency  of  gametes  of  type  (x,)  = 
(xij  •  ■  •  ,  xk)  is  p(xi)  =  p(xh  •  •  •  ,  xk).  There  are  2k  such  gametes  and 

(33)  E  P(xh  •  •  •  ,  xk)  =  1. 

Xi 

These  form  zygotes  by  random  mating  so  that  the  frequency  of  the  zygote 
(xi\xl)  can  be  written  as 

(34)  p(x,\xi)  =  p(xi)p(xi). 

There  are  22k  such  zygotes,  2k  of  which  are  such  that  aq  =  xi  for  all  i.  These 
are  the  “homozygotes,”  and  the  remainder  are  “heterozygotic”  in  respect  of  one 
or  more  loci,  and  are  identifiable  in  pairs  with  the  same  frequencies. 

We  now  have  to  consider  the  manner  in  which  the  zygotes  produce  the  gametes 
which  form  the  next  generation.  The  number  of  gametes  they  produce  will  be 
proportional  to  the  corresponding  w’s,  but  now  that  we  have  more  than  one 
locus,  we  have  to  take  account  of  the  possibility  of  ‘Recombination”  due  to 
“crossing  over.”  Suppose  that  the  loci  are  arranged  in  order  on  the  chromosome 
at  points  Pi,  •  •  •  ,  1\.  Recombination  will  occur  between  Pi  and  P2  if  an  odd 
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number  of  crossovers  occur  between  them.  Since  there  are  k  —  1  gaps  between 
the  loci,  recombination  can  occur  in  2k~l  ways  (including  the  case  of  no  recombi¬ 
nation).  The  probabilities  with  which  these  occur  will  depend  on  the  positions 
of  the  loci  and  their  distances  apart.  Ap  elaborate  theory  (Bailey  [2])  exists  for 
this  dependence,  but  is  not  relevant  here. 

Suppose  that  an  odd  number  of  crossovers  occur  in  each  of  the  gaps  jh  j2, 

•  •  •  ,  js  (ji  =  1,  •  •  •  ,  k  —  1),  and  zero  or  an  even  number  in  the  remaining  gaps. 
For  the  probability  of  this  occurring  we  shall  write 

(35)  2/2(0,  0,  0,  •  •  •  ,  0,  1,  1,  1,  0,  •  •  •)  =  2/2(1,  1,  1,  •  ■  •  ,  1,  0,  0,  0,  1,  •  •  •), 

there  being  k  symbols  equal  to  0  or  1,  and  a  change  in  the  sequence  occurring 
between  the  sth  and  (s  +  l)th  symbol  if  an  odd  number  of  crossovers  occurs  in 
the  sth  gap.  We  write  this  as  2R (5i,  •  •  •  ,  8k)  where  =  0,  1,  and  we  have  made 
this  equal  to  2/2(1  —  8h  •  •  •  ,  1  —  8k)  in  order  to  ensure  symmetry  in  the  follow¬ 
ing  equations.  We  then  have 

(36)  £  R(8h  •••,«*)*  1, 

Si 

where  the  summation  is  taken  over  all  sequences  of  k  zeros  and  ones. 

If  a  recombination  of  type  (6X,  •  •  •  ,  8k)  occurs  in  the  production  of  gametes 
from  a  zygote  of  type  (xi,  •  •  •  ,  xk\x[,  •  •  •  ,  x'k),  the  gametic  output  will  consist 
of  equal  numbers  of  gametes  of  the  form 

(37)  ((1  -  8i)xi  +  tel,  •  •  •  ,  (1  -  8k)xk  +  x'k), 
and  of  the  form 

(38)  (81X1  +  (1  —  5i)ad,  •  •  •  ,  8kxk  +  (1  -  8k)xk). 

Thus,  zygotes  of  type  (rci,  •  •  •  ,  xk\x[,  •  •  •  ,  xk)  will  have  a  total  gametic  output 
such  that  the  numbers  of  gametes  of  type  (38)  will  be  proportional  to 

(39)  w(x i,  •  •  •  ,  xk\x'h  •  •  •  ,  x'k)R(8i,  •  •  •  ,8k). 

The  frequency  of  such  zygotes  is 

(40)  p(xh  •  •  •  ,  xk\x'h  •  •  •  ,  xk)  =  p(xh  •  •  •  ,  xk)p(x[,  •  •  •  ,  x'k), 

which  we  write  as  p(xi\x'i)  =  p{xl)p{x[).  Thus,  finally  the  total  number  of 
gametes  of  type  (yh  •  •  •  ,yk)  contributing  to  the  next  generation  will  be  pro¬ 
portional  to 

(41)  £  £'  £'  R(8h  •  •  •  ,  8k)w(xi\xl)p(xi)p(xl), 

Si  xi  x% 

the  first  sum  being  over  all  sequences  (8h  •  •  •  ,  8k),  and  the  sums  over  Xi,  x[  being 
taken  over  all  values  such  that  ,•  +  (1  —  8,)x{  =  yi}  the  fact  that  this  is  a 
conditional  sum  being  denoted  by  a  dash.  If  we  sum  (41)  over  all  possible  sets 
(Vh  •  ■  •  ,Vk),  we  get 

(42)  W  =  £  £  w{xi\x'i)p(x,)p{x'i), 

Xi  Xi ' 
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the  summation  over  the  Xi  and  x[  being  now  unrestricted.  This  is  the  mean 
fitness  of  the  parent  population. 

Denote  the  frequency  of  the  gamete  (yi,  •  •  •  ,  yk)  entering  into  the  formation 
of  the  next  generation  by  p'(yi,  •  •  •  ,  yk)-  We  now  have 

(43)  p'(yh  •••,?/*)  =  W~x  £  £'  £'  R(8h  ■  •  •  ,  8k)w(xi\x'i)p(xi)p(x,i). 

Si  Xi  Xi' 

These  are  the  fundamental  equations  of  evolutionary  theory  assuming  random 
mating  and  fitnesses  independent  of  gene  frequencies. 

Two  extreme  cases  are  of  particular  interest.  First,  we  may  suppose  that  the 
loci  are  so  closely  linked  that  no  recombination  at  all  is  possible.  We  then  have 
R( 0,  •  •  •  ,  0)  =  R(l,  •  •  •  ,  1)  =  1/2,  and  all  the  other  values  of  R  are  zero. 
Equality  (43)  becomes 

1  MV 

(44)  p'(yh  •  •  •  ,  yk)  =  W~l  £  w(xi\yi)p(xi)p(yi)  =  -  p(yi)  W~l 

These  equations  are  now  the  same  as  those  considered  in  section  4,  as  is  otherwise 
obvious  from  the  fact  that  we  may  consider  the  population  to  be  equivalent  to 
one  in  which  fitness  is  determined  by  a  single  locus  at  which  there  are  2*  alleles. 
Thus,  in  this  case,  the  population  moves  to  the  unique  state  at  which  fitness  is 
maximized,  if  the  stability  conditions  referred  to  before  are  satisfied. 

The  other  extreme  case  is  that  of  independent  assortment.  Here  we  put 
jR(6i,  •  •  •  ,  h)  =  2 so  that 

(45)  p'{yh  •  •  •  ,  yk)  =  2 ~kW~l  £*  w(xi\x'i)p(xi)p(x[), 

Xi,Xt 

where  the  sum  £*  is  taken  over  all  sets  {xi),  (xi)  such  that  (yi)  is  obtained  by  a 
specified  sequence  of  choices  of  the  pairs  x,  and  x(.  The  theory  of  these  equations 
is  much  less  simple  than  that  of  (44). 

7.  Selection  dependent  on  two  loci 

In  an  attempt  to  understand  these  equations  better,  we  now  confine  ourselves 
to  the  case  where  selection  is  dependent  on  only  two  loci.  Furthermore,  since  the 
above  used  notation  is  clumsy  for  only  two  loci,  we  change  the  notation  to  agree 
with  that  of  a  previous  paper  (Moran  [16])  and  which  is  similar  to  that  used  by 
other  writers  (for  example,  Bodmer  and  Parsons  [5]). 

We  suppose  there  are  two  loci  at  which  the  possible  alleles  are  A  and  a,  B  and 
b,  respectively.  The  possible  genotypes  of  the  zygotes  can  then  be  arranged  in 
the  array 

A  ABB  AaBB  aaBB 

(4G)  AABb  AaBb  aaBb 

AAbb  Aabb 


aabb 
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whose  corresponding  fitnesses  we  now  denote  by 

U'  11  W 12  W'l3 

(47)  U' 21  U'  22  U' 23 

W  31  IV3  2  M'33. 

Let  Ci,  c2,  c3,  C4  be  the  relative  frequencies  of  the  gametes  AB,  aB,  Ab,  ab  entering 
into  the  formation  of  the  first  (parental)  generation.  Then  the  frequencies  of  the 


above  zygotes 

in  this  generation 

will  be  given  by  the  array 

r2 

C 1 

2ciC2 

r2 

c2 

(48) 

2cic3 

2c}C4  +  2  c2cz 

2c2c4 

cl 

2  C3C4 

,.2 

64, 

and  the  mean  fitness  W  of  this  generation  will  be  obtained  by  multiplying  each 
of  these  frequencies  by  the  corresponding  wa  and  summing.  Notice  that  AaBb 
is  ascribed  the  same  fitness  whether  it  is  a  “coupling”  zygote  ( AB/ab ),  or  a 
“repulsion”  zygote  ( Ab/aB ).  When  this  is  not  true,  the  theory  is  less  simple 
(Parsons  [19]). 

Let  R  be  the  probability  of  an  odd  number  of  crossovers  between  the  two  loci. 
ThenO  ^  R  ^  1,  and  for  the  most  part,  we  can  also  assume  R  <  1/2.  Enumerat¬ 
ing  the  terms  of  the  sums  in  (43)  we  easily  find  the  following  equations  for  the 
frequencies  (c[,  c2,  C3,  C4)  in  the  next  generation : 


(49) 


(50) 


(51) 


(52) 


C[W  =  Wnc\  +  W12C1C2  -f  W21C1C3  +  U'wCiCi  +  RW23.iC2.C3  —  C1C4) 
=  2  Ci  b  Rw22ic2.Cz  —  C1C4) ; 

C2IV  =  U'i2CiC2  +  Wizti  +  W23C2.Cz  +  W2ZC2C4  —  RW23{C2Cz  —  C1C4) 

=  2  C2  "fc - Rw2t(c2Cz  —  C1C4) ; 

C3W  =  W21C1CZ  +  W22C2CZ  +  Wz\Cz  +  W32C3C4  —  Rw22{C2Cz  —  C1C4) 

_  1  dW  p  ,  v 

—  o  Cz  -r - KW22{C2CZ  —  C1C4) ; 

Z  oCz 

ciW  =  W22C1C4  +  U'23C2C4  +  WzzCzCi  +  Wzzcl  +  /7u'22(c2c3  —  C1C4) 


=  \  Ci  licl  Rw2^C2°3  ~  ClC^- 


The  quantities  Ci,  c2,  c3,  c4  enter  into  these  equations  in  a  symmetrical  manner. 
Notice  that  this  symmetry  is  not  described  by  the  full  permutation  group  on  four 
symbols  but  by  a  permutation  group  of  order  8. 

I11  these  equations  it  is  convenient  to  remember  that  IP  is  a  quadratic  form 
in  the  four  variables  (ci,  c2,  c3,  c4)  with  matrix 
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M’U 

it  12 

U  21 

ll  22 

(53) 

W>12 

U'n 

V'22 

W’23 

W  21 

it-22 

U'  31 

W32 

W’22 

W23 

u'n 

U'  33. 

Equations  (49)-(52)  were  derived  by  Lewontin  and  Kojima  [11]  (see  also 
Kimura  [8]). 

If  R  >  0,  the  maximization  of  W  subject  to  £]  ct  =  1  does  not  give  equations 
equivalent  to  those  obtained  for  internal  stationary  points  by  putting  c\  =  c,-  in 
(49)-(52).  In  fact,  to  maximize  W  with  X  c»  =  1  is  equivalent  to  maximizing 
W/(J^  c,)2  for  unrestricted  nonnegative  ci}  and  then  rescaling  the  c,  if  necessary. 
This  leads  to  the  equations 

dW 

(54)  =  2W>  i-  1,  •  •  • ,  4 

for  an  internal  point.  If  R  >  0,  these  equations  are  not  consistent  with  (49)-(52) 
unless  c2c3  =  CiC4,  which  is  not  in  general  true.  Thus  the  stationary  points  of  the 
system  defined  by  (49)-(52)  are  not  found  by  maximizing  W,  and  it  can  be  shown 
(Moran  [16])  that  W  can  decrease  from  one  generation  to  the  next  even  when 
each  is  produced  by  random  mating. 

Lewontin  and  Kojima  call  (c2c3  —  cic4)  the  “coefficient  of  linkage  disequi¬ 
librium,”  but  this  is  misleading  since  it  implies  that  the  fact  c2c3  ^  c4c4  is  de¬ 
pendent  on  the  existence  of  linkage  between  the  loci  (namely,  R  9^  1/2).  This 
is  not  true,  and  c2c3  —  c4c4  might  be  better  called  a  “coefficient  of  epistatic 
disequilibrium.” 

Wright  thought  that  evolutionary  change  could  be  described  by  saying  that 
populations  tend  to  move  so  as  to  maximize  W,  and  his  argument  can  be  sum¬ 
marized  as  follows.  Using  nonoverlapping  generations  and  the  above  model  it  is 
clear  that  the  population  at  each  generation  satisfies  the  Iiardy-Weinberg  law 
for  each  locus  separately;  that  is,  the  frequencies  of  the  homozygotes  are  the 
squares  of  the  corresponding  gene  frequencies.  Consider  what  happens  at  the 
first  locus  and  write  p  for  the  frequency  of  A  and  q  =  1  —  p  for  the  frequency 
of  a  (thus  p  =  Ci  +  c3,  q  =  c2  -f  c4).  Then  the  gene  frequency  p'  in  the  next 
generation  will  be  given  by  (10)  where 

(55)  W  =  u\p2  +  2  w2pq  +  ivsq2, 


(ci  +  c3)2 


{unci  +  2u’2iCiC3  +  W’sicl}, 


(56) 


u%2  =  ^  ~  _j_  ~ y  {ti2CiC2  -j-  ie22(cic4  4-  c2c3)  +  u’32c3c4}, 


7 - r — v5  {“’lad  +  2 u'23c2c4  +  wncV) . 

(c2  +  c4)2 
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In  carrying  out  the  differentiation  in  (10),  the  have  to  be  kept  fixed.  Wright 
thus  concluded  that  for  stationarity  we  must  have 


(57) 


and  this  is  true  in  the  above  sense  when  the  are  kept  fixed.  Since  W  is  not  a 
function  of  the  gene  frequencies  alone,  this  does  not  imply  that  the  internal 
stationary  states  of  the  population  correspond  to  stationary  values  of  W.  In 
fact,  Wright  assumed  that  W  could  be  written  in  the  form 


(58)  W  =  wnp2P2  +  2wupqP2  +  wnq2P2 


+  WnP2PQ  +  IwvpqPQ  +  2wnq2PQ 


H-  wSip2Q2  +  2 wvpqQ2  +  w3Zq2Q2 

where  P  —  1  —  Q  is  the  frequency  of  B ;  that  is,  P  —  Ci  +  c2.  This  is  clearly  not 
correct.  Plotting  (58)  as  a  function  of  p  and  P  we  obtain  an  “adaptive  topog¬ 
raphy/’  but  from  what  has  been  said  above,  this  throws  no  light  on  the  behavior 
of  the  population. 

Using  this  incorrect  theory,  I  attempted  (Moran  [15])  to  discuss  the  prob¬ 
lem  of  determining  how  many  different  isolated  stationary  internal  points 
such  a  population  could  have,  and  how  many  of  these  could  be  stable  for  small 
perturbations.  The  condition  that  the  points  be  isolated  is  necessary  as  it  is 
trivial  to  show  that  stationary  points  can  fill  up  a  whole  interval.  The  problem 
is  then  that  of  finding  the  stationary  points  of  (58),  and  the  answer  is  that  there 
can  be  at  most  five  such  points,  and  at  most  three  local  maxima,  both  bounds 
being  attainable. 

This  theory  is  incorrect,  and  the  number  of  possible  stationary  and  stable 
points  of  the  system  (49)-(52)  is  an  open  question.  However,  it  is  possible  that 
five  and  three  are  the  correct  bounds,  and  this  is  suggested  by  the  following 
argument. 

In  the  single  locus  problem  considered  in  sections  3  and  4,  the  position  and 
stability  of  the  unique  internal  stationary  point  was  unaltered  when  the  Wi 
underwent  the  transformation  Wi  — >  Xwt-  +  n,  where  X  >  0,  n  >  —  X  min  w{.  On 
the  other  hand,  for  the  system  (49)-(52)  the  stationary  solutions  are,  in  general, 
only  invariant  for  transformations  of  the  form  w\j  — >  \wij}  X  >  0.  However,  con¬ 
sider  the  effect  of  a  transformation  of  the  type  — >  1  +  X(tc„-  —  1).  If  X  — >  0, 
the  new  wa  tend  to  unity.  It  can  then  be  easily  shown  that  c2c3  —  CiC4  —*■  0,  and 
the  stationary  points  tend  in  the  limit  to  the  stationary  points  of  the  function 
defined  by  (58).  This  suggests  that  for  wa  near  unity  the  number  of  possible 
stationary  points  remains  the  same,  and  hence,  this  holds  true  generally.  How¬ 
ever,  this  has  not  been  proved. 

From  equations  (49)-(52)  wre  can  easily  derive,  as  in  section  5,  a  set  of  four 
differential  equations  to  describe  the  evolution  of  the  population  in  a  continuous 
time  model.  However,  these  are  incorrect  for  the  same  reasons  as  those  considered 
in  section  5,  and  a  full  description  of  such  a  population  requires  the  frequencies 


UNSOLVED  PROBLEMS  IN  EVOLUTIONARY  THEORY 


473 


of  each  of  the  zygotes.  There  are  nine  of  these,  adding  to  unity,  and  their  values 
do  not,  in  general,  remain  equal  to  the  values  cf,  2cic2,  •  •  •  given  in  section  7.  The 
differential  equations  derived  directly  from  (49)-(52)  may,  however,  be  regarded 
as  a  useful  approximation  when  the  are  nearly  equal. 

Since  the  maximization  of  W  does  not  provide  a  method  for  determining  the 
stationary  points,  we  may  ask  if  there  is  any  other  function  whose  extreme  valuer 
would  serve.  Relatively  trivial  examples  are  easy  to  construct,  for  example,  by 
summing  the  squares  of  the  differences  of  the  right-hand  sides  of  (49)-(52)  from 
the  values  Wa,  (i  =  1,  •  •  •  ,  4).  This  is  not  a  quadratic  form  in  the  a,  and  it  seems 
probable  that  there  is  no  such  quadratic  form  whose  extreme  or  stationary  values 
correspond  to  stationary  states  of  the  population.  Furthermore,  if  a  suitable 
function  could  be  found,  it  would  be  desirable  that  it  should  be  nondecreasing 
from  generation  to  generation.  One  might  hope  in  this  way  to  construct  a  theory 
of  the  equations  of  evolutionary  change  analogous  to  the  general  theories  of 
dynamics. 

I  have  not  succeeded  in  constructing  such  a  function  directly  from  equations 
(49)-(52).  Professor  S.  Karlin  has,  however,  shown  that  if  we  assume  that 
iteration  of  these  recurrence  relations  always  results  in  the  state  of  the  system 
converging  to  a  set  of  points  S,  then  there  exists  a  continuous  function  of  the  c, 
which  is  nondecreasing.  I  am  indebted  to  him  for  the  following  proof. 

Let  2?  be  a  vector  in  a  bounded  space  with  a  distance  function  p(x,  y),  and  let 
x'  =  T%  be  a  transformation  which  has  the  property  that  the  nth  iterate,  Tnxt 
converges  to  a  subset  S ;  that  is,  p(Tnx,  S )  — ►  0.  Then  there  exists  a  continuous 
function,  u(x),  such  that  u(Tx)  ^  u{x)  for  all  X-  To  prove  this,  define  a  continu¬ 
ous  function 

Uj{s)  >  0,  if  p(x,  S)  >  j~\ 

Uj(x)  =  0,  if  p(x,  S )  ^  j-\ 

u(x)  =  L2-'{l+  f)  Uj(Tnx)\  ■ 

The  inner  sum  is  really  finite  since  Tnx  — » S,  and  the  outer  sum  is  convergent. 
It  is  then  obvious  that  u(Tx)  ^  u(x). 

This  result  assumes  that  we  know  that  the  iterates  converge  to  the  set  S.  This 
seems  highly  probable  in  the  genetic  case  but  is  not  known  for  certainty.  What 
is  needed  is  to  construct  a  function  like  u{x)  directly  from  the  Wa  and  c,  and  use 
this  to  deduce  the  asymptotic  behavior  of  Tnx. 

8.  The  stability  of  equilibria  and  the  effects  of  mutation 

Suppose  that  the  c*  are  the  values  of  a  stationary  internal  point  so  that 
0  <  Ci  <  1.  Then  equations  (49)-(52)  must  hold  with  cf  replaced  by  ct.  As  in 
Moran  [16]  (see  also  Kimura  [8]),  we  investigate  the  effect  of  a  small  pertur¬ 
bation  around  this  point  by  replacing  the  c,-  with  ct  +  5„  where  the  5,  are  small. 


(59) 

and  then  define 

(60) 
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In  the  next  generation  we  will  get  values  c[  which  we  put  equal  to  ct  +  5/,  where 
the  8l  are  also  small.  Then  expanding  the  recurrence  relation  around  the  point 
(ci,  c2,  c3,  c4)  and  taking  the  linear  terms  only,  we  get 

(61)  81  =  £  a,j  8j 

j 

where  {ai})  is  a  matrix  such  that 


(62) 
and  if  i  9 -  j, 


an  =  IT" 


ldIT  1 
2  dd  +  2  Ci 


d2IT  p  dD 

— j-  +  Rwiikii)  — 


dci 


dc; 


(63) 


an  =  W-U^c 


1  dW 


2  1  dci  dCj 


-j-  Ru'nk{i) 


d_D 

dCj 


Ci 


a  iiq 

dcjj 


where  k(i)  =  4~1  if  i  =  1,  4,  and  where  k(i)  =  —  1  if  i  =  2,  3,  and  D  is  defined 
as  c2c3  —  C1C4. 

It  is  easy  to  verify  that  a>i  —  0,  and  therefore  8[  =  0,  as  we  expect. 
The  matrix,  therefore,  has  one  root  equal  to  zero,  and  the  state  will  be  stable  if 
the  other  three  roots  are  less  than  unity  in  modulus. 

Since  in  general  there  are  neighboring  points  to  a  stationary  point  (c,)  at 
which  IT  is  both  greater  and  less  than  its  value  at  (ci),  we  might  expect  that  the 
effect  of  mutation,  which  will  shift  the  stationary  point  somewhat,  could  either 
increase  or  decrease  the  fitness.  Consider  the  effect  of  such  mutation  at  a  stable 
point. 

Suppose  that  A  mutates  to  a  at  a  rate  ai  per  locus  per  generation,  and  a  to 
A,  B  to  b,  b  to  B  similarly  at  rates  a2,  0i,  and  02,  respectively.  If  the  (c*)  are  the 
initial  frequencies  of  the  gametes,  their  frequencies  after  mutation  will  be 


(64)  Ci  4-  «i  =  (1  —  «i)(l  —  0i)ci  4-  a2(l  —  0i )c2  4"  (1  —  «i)02C3  4"  «202C4, 

(65)  c2  4~  «2  =  «i(l  —  0i)ci  4“  (1  —  «2)(1  —  0i )c2  4-  «i02C3  4-  (1  ~~  «2)02c4, 

(66)  c3  4~  «3  =  (1  —  «i)0iCi  4"  «20iC2  4"  (1  —  «i)(l  —  02 )c3  4~  <*2(1  —  02)c4, 

(67)  c4  4"  €4  =  ai0iCi  4~  (1  ~  «2)0ic2  4"  «i(l  —  02 )c3  4~  (1  —  <22)  (1  —  02)c4. 


The  ai  and  0,  being  very  small,  we  may  neglect  terms  of  higher  order  than  the 


first  and 

replace  the  above  equations  by 

(68) 

ei  —  —  (ai  4“  0i)ci 

4"  «2C2  4"  02 c3 

y 

(69) 

e2  =  aiCi 

—  (a2  4~  0i)c2 

4"  02 Ci  , 

(70) 

e3  =  0iCi 

—  (c*i  4-  02 )c3  4"  a2c4  , 

(71) 

«4  = 

0i c2  4“  «iC3 

—  (a2  4"  02 )c4. 

To  find  the  new  stationary  position  we  use  (49)-(52)  with  c[,  •  •  •  ,  ci  replaced 
by  Ci,  •  •  •  ,  Ci  on  the  left-hand  sides,  and  Ci,  •  •  •  ,  c4  on  the  right-hand  sides  re¬ 
placed  by  Ci  4-  €1,  •  •  •  ,  Ci  4-  e4,  and  then  solve  for  the  c*.  The  new  values  of  the 
Ci  will  differ  from  the  previous  stationary  values  by  quantities  771,  r?2,  773,  774,  say, 
and  the  effect  on  the  fitness  is  to  add  to  it  an  amount 
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(72) 


E* 


dW 

dCi 


where  the  derivatives  are  taken  at  the  previous  stationary  stable  point. 

The  rji  will  in  general  be  of  the  same  order  as  the  mutation  rates,  and  if  W  is 
not  at  a  stationary  point,  the  effect  on  W  will  be  of  the  same  order  as  these 
mutation  rates.  This  is  in  sharp  contrast  to  the  situation  for  a  single  locus  where 
the  effect  of  mutation  is  of  the  second  order  in  the  mutation  rates  if  the  original 
stationary  point  was  internal,  and  of  the  first  order  only  when  the  original 
position  was  at  the  boundaries.  Furthermore,  it  is  now  no  longer  obvious  that 
the  effect  of  mutation  is  to  decrease  the  fitness,  for  W  is  not  in  general  a  maximum 
value.  If  the  transformation  (G4)-(67)  were  a  perfectly  general  one,  it  seems  that 
the  Ci  could  be  displaced  in  any  direction  (consistent  with  X)  a  =  1),  and  W 
could  therefore  be  decreased  or  increased.  We  would  then  have  the  intriguing 
conclusion  that  the  mean  fitness  of  a  population  could  be  increased  by  mutation. 
However,  the  transformation  (64)-(67)  is  not  a  general  one,  since  it  depends  on 
only  four  constants  instead  of  twelve  (twelve  because  X)  must  equal  zero). 
Thus  it  may  be  true  that  this  special  type  of  perturbation  can  only  result  in  a 
decrease  in  W,  but  this  has  not  been  proved. 


9.  What  is  epistasis? 

Epistasis  may  be  said  to  occur  in  the  joint  effect  of  two  loci  if  the  effect  of 
varying  the  genotype  at  one  locus  depends  on  the  genotype  at  the  other.  This  is 
an  ambiguous  definition,  particularly  when  the  effect  concerned  is  the  fitness  of 
the  joint  genotype.  Thus  suppose  first  that  w,j  can  be  written  as  u{  +  vj}  where 
Ui,  (i  =  1,  2,  3)  depends  on  the  state  at  the  first  locus  and  Vj  on  the  state  at  the 
second.  The  effects  of  the  two  loci  may  then  be  described  as  “additive/’  and  if 
this  condition  does  not  occur,  we  could  say  that  there  is  epistasis  between  the 
two  loci. 

Alternatively,  we  might  define  the  absence  of  epistasis  by  putting  =  u,Vj 
where  Ui  and  Vj  are  defined  in  a  similar  way.  We  shall  call  this  the  “multipli¬ 
cative”  case.  Since  “fitness”  is  defined  to  be  proportional  to  the  expected  number 
of  gametes  that  a  newly  formed  zygote  of  the  specified  type  will  contribute  to 
the  next  generation,  it  appears  at  first  sight  that  the  behavior  of  the  gene  fre¬ 
quencies  at  the  two  loci  in  the  multiplicative  case  should  be  independent  of  each 
other  so  that  epistasis  is  best  defined  as  the  absence  of  the  condition  w,j  =  UiVj. 

This  is  not  correct,  as  we  shall  now  see  in  investigating  some  of  the  conse¬ 
quences  of  assuming  that  in  (49)-(52)  the  w,j  are  one  of  these  two  forms. 

We  suppose  Ui  <  u2  >  u3,  Vi  <  v2  >  vs,  and  consider  separately  the  two  cases 
wu  =  Ui  +  Vj  (“additive”),  and  =  u(Vj  (“multiplicative”).  Write 


(73)  P  =  1  -  Q  = 

(74)  V  =  1  -  q  = 


u2  —  Uz 
2w2  —  Ui  —  uz 

V-2.  —  Vz 
2v2  —  Vi  —  Vz 
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Then  the  set  of  values 


(75)  (ci  =  Pp,  c2  =  Pq,  cz  =  Qp,  c4  =  Qq) 

is  a  solution  of  (49)-(52)  for  both  the  additive  and  multiplicative  case.  This  is  an 
internal  point  and  for  it  c2c3  —  cxc4  =  0,  so  that  the  value  of  R  is  not  relevant. 
In  the  additive  case,  equations  (49)-(52)  can  be  written  as  follows : 

(76)  c[W  =  Ci  {ihC\  +  uxc2  +  utfz  +  u2c4}  +  cx{vxcx  +  v2c2  +  vx  c3  +  vtfi} 

+  R(Ui  +  V2)  (C2C3  —  CiC4), 

(77)  c'2W  =  c2  (wiCi  +  uxc2  +  UiCz  +  U2C4}  +  c2{t’2ci  +  v3c2  +  v2cs  +  y3c4} 

—  R(U2  +  V2)(C2C3  —  CiC4), 

(78)  C3W  =  c3  {u2c\  +  u2c2  +  U3C3  +  u-iC4}  +  c3{yiCi  +  v2c2  +  vx  c3  +  v2c4} 

—  R(u2  +  V2)(c2C3  —  CiC4), 

(79)  ciW  =  c4{u2ci  +  u2c2  +  U3C3  +  ihCi}  +  c3{y2Ci  +  VzC2  +  v2cz  +  f3  c4) 


where 


+  R(u2  4-  v2)  (c2c3  -  CiC4), 


(80)  W  =  Wi(ci  4"  c2)2  4"  2w2(ci  4"  c2)(c3  4~  c4)  4"  w3(c3  4"  c4)2 

4~  »i(ci  4-  c3)2  4-  2y2(ci  4“  c3)(c2  4~  c4)  +  v4(c2  c4)2. 

From  this  we  see  that  the  equations  for  a  stationary  point  with  R  =  0  involve 
only  the  marginal  sums  cx  4*  c2,  c3  4-  c4,  cx  +  c3,  c2  4-  c4,  that  is,  the  gene  fre¬ 
quencies.  It  follows  therefore  that  for  R  —  0,  (cx  —  Pp  x,  c2  —  Pq  —  x,  c3  = 
Qp  ~  x,  c4  —  Qq  4-  x)  is  also  a  solution  for  any  value  of  x  that  makes  all  the  c, 
nonnegative.  Thus  none  of  these  stationary  states  can  be  strictly  stable. 

Notice  further  that  all  these  points  are  maxima  of  W  as  given  by  (80),  although 
they  are  not  strict  local  maxima.  For  R  =  0,  W  is  nondecreasing  from  one 
generation  to  the  next  as  follows  from  the  case  of  one  locus  with  four  alleles. 

If  R  >  0,  we  can  prove  that  there  is  only  one  internal  point  which  is  stationary, 
and  this  must  be  given  by  (75).  To  do  this,  put  cl  =  c,  in  (76)-(79)  and  divide 
by  Ci,  c2,  c3,  Ci,  respectively,  since  these  are  nonzero.  Subtracting  (77)  and  (78) 
from  the  sum  of  (76)  and  (79),  we  get 

(81)  R(u2  4-  v2 )(c2c3  -  cic4)(ci-1  +  c2 1  +  c-f1  4*  Ci1)  =  0, 

from  which  it  follows  that  c2c3  =  CiC4.  If  the  latter  is  true,  we  can  put  Ci  =  xy, 
c2  =  x(l  —  y),  c3  =  (1  —  x)y,  c4  =  (1  —  z)(l  —  y),  and  the  only  values  of  x  and 
y  satisfying" (76)-(79)  will  be  found  to  be  x  =  P,  y  =  p. 

It  is  not  known  whether  this  solution  is  globally  stable,  namely  that  all  internal 
points  correspond  to  populations  which  converge  to  (75),  although  local  stability 
could  be  investigated  by  using  the  criterion  of  section  8.  It  might  seem  that  a 
proof  of  stability  would  be  easily  carried  out  along  the  lines  of  the  single  locus 
case,  but  this  is  not  true.  It  is  in  general  not  true  that  the  expression  (c^  —  cxc4) 
decreases  in  absolute  value  from  generation  to  generation,  nor  that  the  gene 
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frequencies  cx  +  c2,  cx  +  c3  converge  monotonically  to  their  final  values.  These 
surprising  facts  can  be  illustrated  by  simple  numerical  examples. 

The  multiplicative  case  is  even  less  simple.  Equations  (49)-(52)  now  become 
the  following : 

(82)  c[W  =  Ci{ciWiVi  +  c2uxv2  +  c3u2v  1  +  C4W2V2}  +  Ru2v2(c2c3  —  C1C4), 

(88)  c'2W  =  c2{cxuxv2  C2M1V3  +  c3u2v2  +  c4w2i>3)  —  Ru2v2(c2c3  —  cxc4), 

(84)  c'zW  =  c3  [ciu2vi  -f  c2u2v2  +  czu2vx  +  c3u3v2}  —  Ru2v2(c2c3  —  cxc4), 

(85)  ciW  =  Ci{ciu2v2  +  c2u2vz  +  C3U3V  2  +  C3M3V3}  +  Ru2v2(c2c2  —  cxc4). 

The  set  of  values  (75)  is  again  a  solution  of  these  equations  but  may  or  may 
not  be  stable.  We  may  investigate  some  of  the  possible  forms  of  behavior  by 
considering  the  simple  symmetric  scheme  of  fitnesses  given  by 

1  2  1 

(86)  2  a  2 

1  2  1 


where  a  >  0.  Lewontin  and  Kojima  [11]  and  Parsons  [20]  have  also  studied 
special  cases  like  this.  The  case  a  =  3  is  “additive”  and  a  =  4  is  “multiplicative.” 
Equations  (49) -(52)  then  become 

(87)  Wei  =  Ci  +  2ciC2  +  2cxc3  +  acxc4  +  Ra{c2c-A  —  cxc4), 

(88)  Wc-2  =  2c]C2  +  c|  +  ac2c2  +  2  e2c4  —  Ra(c2c3  —  cxc4), 

(89)  Wcrz  =  2cxC3  +  ac2cz  +  c|  +  2c3c4  —  Ra(c2c3  —  cxc4), 

(90)  Wei  =  acic4  +  2c2c4  +  2c3c4  +  e\  +  Ra(c2c3  —  cxc4). 


Suppose  that  we  start  from  an  initial  state  in  which  cx  =  c4,  c2  =  c3.  Then 
in  the  next  generation  c[  =  ci,  ci  =  C3  by  symmetry,  and  furthermore, 
Ci  +  c2  =  Ci  +  c3  =  ci  +  ci  =  ci  +  c3  =  1/2.  Simplifying  the  equations  we  find 
c]  —  ci  =  E(cx  —  c2)  where 


(91) 


F  = 


1  +  a  —  2Ra 
1  +  a  —  8  (a  —  3)ciC2 


Suppose  first  that  R  =  0.  If  a  =  3,  F  =  1  and  cl  =  c,.  This  is  the  neutral  equi¬ 
librium  case  already  considered  above  since  if  a  =  3,  the  fitnesses  are  additive. 
If  a  >  3,  F  >  1  and  cx  =  c4  converges  to  1/2,  and  c2  =  c3  to  zero.  If  a  <  3, 
then  F  <  1  and  all  c4  converge  to  1/4.  Thus,  if  a  <  3  the  state  c,  =  1/4  is  proba¬ 
bly  globally  stable  (we  have  not  actually  proved  this  as  we  have  only  considered 
the  situation  where  cx  =  c4,  c2  =  c3),  while  if  a  >  3,  the  state  c,  =  1/4  is  certainly 
unstable. 

Now  suppose  R  >  0.  If  a  <3,  we  again  get  F  <  1,  and  the  state  c,  =  1/4  is 
probably  globally  stable.  If  a  >  3,  the  value  of  F  depends  on  a,  and  F  <  1  or 
>  1  according  to  whether  2 Ra  is  greater  or  less  than  8 (a  —  3)<yc2.  Near  the  point 
Ci  =  1/4,  cxc2  is  nearly  1/16  and  thus,  if  2 Ra  >  l/2(a  —  3),  cx  —  c2  will  tend  to 
zero  (in  particular,  this  is  true  for  R  —  1/2),  and  the  point  c,  =  1/4  is  probably 
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locally  stable  (this  could  be  investigated  using  the  criterion  of  section  8).  On  the 
other  hand,  if  2 Ra  <  l/2(a  —  3)  (for  instance,  for  a  =  4,  R  <  1/10),  Ci  —  c2 
will  increase  until  CiC2  =  R{i(a  —  3)}_1.  Beyond  this  point  Ci  —  c2  will  decrease. 
This  is  therefore  a  stationary  point  and  probably  also  a  locally  stable  point.  As 
an  example,  consider  the  fitnesses  (86)  with  a  —  4,  R  =  3/64.  Then  (ct)  = 
(3/8,  1/8,  1/8,  3/8)  and  Ci  =  (1/8,  3/8,  3/8,  1/8)  are  stationary  points.  Thus 
for  sufficiently  tight  linkage,  the  point  c*  =  1/4  is  certainly  unstable.  For  a  =  4 
and  R  =  1/2  there  is  no  point  for  which  CiC2  =  Ra{±(a  —  3)}_1,  since 
2 Ra  >  1/2  (a  —  3)  and  CiC2  necessarily  not  greater  than  1/16  if  C\  =  c4,  c2  =  c3. 
This  suggests  that  for  R  =  1/2  the  point  ct  =  1/4  is  locally  stable  (as  could  be 
investigated)  and  even  globally  stable. 

The  above  results  suggest  that  for  R  =  1/2  the  internal  point  (75)  is  stable, 
and  even  globally  stable,  in  both  the  additive  and  multiplicative  cases,  but  as 
stated  above  the  c*  do  not  in  the  additive  case  always  converge  monotonically 
to  their  final  values  as  they  do  for  a  single  locus.  In  this  case  the  expression 
c2c3  —  CjC4  can  also  increase  from  an  initial  value  of  zero. 

It  is  also  curious  to  note  that  in  the  multiplicative  case  it  is  possible  for 
c2c3  —  C1C4  to  increase  in  absolute  value,  but  if  it  is  initially  zero,  it  remains  zero. 
This  can  be  shown  by  an  interesting  identity  pointed  out  to  me  by  Professor 
K.  Mahler.  Multiplying  the  equations  (82)-(85)  together  in  pairs  and  subtract¬ 
ing,  we  get 

(92)  (c2d  —  del)  IF2  =  (c2c3  —  CiC4)  (12  —  R112V2W), 
where 

(93)  12  =  UiU2(viCi  +  V2C2)  (y2Ci  +  V3C2)  +  w2w3(vic3  +  v2c4)  (v2cz  +  v3c4) 

4"  iU]U‘i  +  U2)V\V2C\Cz  -p  (U\Uz  4'  uI)v2VzC2Ca 
4-  {uiUzv^z  4-  ulvDifiiCi  4-  C2c3). 

It  is  easy  to  verify  from  this  that  it  is  possible  to  have  12  —  R112V2W  >  IF2, 
so  that  | c'&z  —  c(cl|  >  |c2c3  —  CiC4|.  On  the  other  hand,  if  c2c3  —  C1C4  =  0,  then 
C2C3  —  del  =  0. 

10.  Final  remarks 

It  will  be  seen  that  much  remains  to  be  done  in  this  subject.  We  need  to 
understand  much  better  the  general  theory  of  section  6  and  the  algebraic  struc¬ 
ture  which  lies  behind  it.  The  behavior  of  the  relatively  simple  case  of  two  loci 
each  with  two  alleles  also  needs  a  great  deal  more  study. 

Applications  of  these  equations  to  particular  problems  have  been  made.  Thus, 
Kimura  [8]  has  studied  a  system  in  which  selection  leads  to  closer  linkage 
between  two  loci,  and  Ewens  (personal  communication)  has  used  (49)-(52)  to 
study  the  evolution  of  dominance. 

Applications  to  human  disease  are,  perhaps,  even  more  interesting.  Incom¬ 
patibility  between  blood  groups  can  cause  the  death  of  a  foetus,  and  furthermore, 
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there  is  now  evidence  that  a  considerable  proportion  of  fertilized  eggs  never  result 
in  a  pregnancy,  and  that  quite  large  selective  differences  in  this  respect  exist. 
There  is  even  evidence  that  existence  of  an  ABO  incompatibility  can  inhibit  the 
effects  of  an  Rh  incompatibility.  This  type  of  phenomenon  may  well  lead  to  an 
explanation  of  some  of  the  puzzling  features  of  the  genetics  of  schizophrenia 
(Moran  [17]).  This  is  a  mental  disease  of  young  adults  for  which  there  is  now 
substantial  evidence  of  a  genetic  bases,  although  its  exact  mode  of  inheritance 
is  not  known.  The  overt  expression  of  the  disease  has  undoubtedly  a  negative 
selective  effect,  but  the  underlying  genetic  polymorphism  must  be  very  stable 
since  the  frequency  of  the  disease  does  not  vary  greatly  between  different 
countries  (although  there  are  large  variations  between  different  social  classes). 
Calculations  based  on  simple  models  suggest  that  if  the  polymorphism  is  main¬ 
tained  by  selective  advantages  in  heterozygotes,  these  will  have  to  be  at  least  of 
the  order  of  5%,  and  more  probably  10%.  The  puzzle  is  to  understand  how  so 
large  a  selective  advantage  can  come  about.  If,  however,  the  presence  of  a  factor 
for  schizophrenia  can  inhibit  one  of  the  incompatibilities,  such  as  those  for  blood 
groups,  which  result  in  such  a  large  loss  of  fertilized  eggs,  the  polymorphism 
would  be  explained. 

It  will  then  be  necessary  to  study  the  stability  of  systems  like  (49)-(52)  with 
more  than  two  alleles  at  each  locus.  However,  further  complications  are  likely 
to  arise  in  this  and  similar  problems  of  human  genetics,  for  the  definition  of  the 
fitnesses  w,-j  conceals  strongly  simplifying  assumptions;  it  is  probable  that  it  will 
be  necessary  to  define  “fitness”  as  a  function  of  both  parental  genotypes  in  a 
mating.  This  leads  to  a  much  more  complicated  theory. 

Problems  of  a  different  kind  arise  when  we  attempt  to  fit  a  model  of  the  type 
described  by  (49)-(52)  to  observed  data.  If  we  can  observe  in  a  large  population 
the  relative  numbers  of  reproductive  adults  of  the  nine  genotypes  AABB,  •  •  •  , 
aabb,  we  obtain  eight  independent  quantities.  Supposing  the  polymorphism  to  be 
stationary,  we  have  eleven  quantities  to  estimate :  the  eight  independent  ratios 
of  the  u\j,  and  three  independent  quantities  out  of  C\,  c2,  c3,  c4.  These  eleven 
quantities  are  all  identifiable  because  the  assumption  of  stationarity  used  in  their 
estimation  implies  three  independent  relations  between  them  (that  is,  three  of 
the  equations  (49)-(52)).  We  could  then  go  further  and  test  the  resulting  system 
for  stability  by  the  criterion  of  section  8,  but  the  effect  of  sampling  variations  on 
this  is  not  known. 

I  am  indebted  to  Professors  S.  Karlin  and  K.  Mahler,  and  Drs.  H.  Cann, 
A.  W.  Edwards,  and  W.  J.  Ewens  for  some  helpful  criticisms  and  suggestions. 
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